A decision-theoretic saliency, its biological plausibility and implications for pre-attentive vision
نویسندگان
چکیده
A decision-theoretic formulation of visual saliency, first proposed for top-down processing (object recognition) in (Gao & Vasconcelos, 2005) is extended to the problem of bottom-up saliency. Under this formulation, optimality is defined in the minimum probability of error sense, under a constraint of computational parsimony. The saliency of the visual features at a given location of the visual field is defined as the power of those features to discriminate between the stimulus at the location and a null hypotheses. For bottom-up saliency, this is the set of visual features that surround the location under consideration. Discrimination is defined in an information-theoretic sense and the optimal saliency detector derived for a class of stimuli that complies with known statistical properties of natural images. It is shown that the optimal detector consists of what is usually referred to as the standard architecture of V1 (Carandini, Demb, Mante, Tolhurst, Dan, Olshausen, et al., 2005): a cascade of linear filtering, divisive normalization, rectification and spatial pooling. The optimal detector is also shown to replicate the fundamental properties of the psychophysics of saliency (Treisman & Gelade, 1980): stimulus pop-out, saliency asymmetries for stimulus presence vs. absence, disregard of feature conjunctions, and Weber’s law. Finally, it is shown that the optimal saliency architecture can be applied to the solution of generic inference problems. In particular, for the class stimuli studied, it performs the three fundamental operations of statistical inference: assessment of probabilities, implementation of Bayes decision rule, and feature selection.
منابع مشابه
On the plausibility of the discriminant center-surround hypothesis for visual saliency.
It has been suggested that saliency mechanisms play a role in perceptual organization. This work evaluates the plausibility of a recently proposed generic principle for visual saliency: that all saliency decisions are optimal in a decision-theoretic sense. The discriminant saliency hypothesis is combined with the classical assumption that bottom-up saliency is a center-surround process to deriv...
متن کاملDecision-Theoretic Saliency: Computational Principles, Biological Plausibility, and Implications for Neurophysiology and Psychophysics
A decision-theoretic formulation of visual saliency, first proposed for top-down processing (object recognition) (Gao & Vasconcelos, 2005a), is extended to the problem of bottom-up saliency. Under this formulation, optimality is defined in the minimum probability of error sense, under a constraint of computational parsimony. The saliency of the visual features at a given location of the visual ...
متن کاملTop-down weighting of visual dimensions: Behavioral and electrophysiological evidence
Visual search for an odd-one-out target is speeded when observers are provided with a cue word indicating the most probable target-defining dimension (e.g., form) on a given trial (Müller, Reimann, & Krummenacher, 2003). According to the 'dimension-weighting' account (e.g., Müller, Heller, & Ziegler, 1995), this semantic cueing effect originates from a pre-attentive processing stage: the coding...
متن کاملBiologically plausible saliency mechanisms improve feedforward object recognition
The biological plausibility of statistical inference and learning, tuned to the statistics of natural images, is investigated. It is shown that a rich family of statistical decision rules, confidence measures, and risk estimates, can be implemented with the computations attributed to the standard neurophysiological model of V1. In particular, different statistical quantities can be computed thr...
متن کاملAttentive Mechanisms & Their Relevance to Random Access Computer Vision
Research into the use of random access sensors in an industrial context has been motivated by the need to reduce the bottleneck in image acquisition time. With the rapid increase in processor speeds of recent years, the acquisition of the image from camera sensor to machine memory has become the slowest process in the machine vision system. Random access sensors provide speedy direct access to ...
متن کامل